Batch Evaluation Metrics in Information Retrieval: Measures, Scales, and Meaning

نویسندگان

چکیده

A sequence of recent papers, including in this journal, has considered the role measurement scales information retrieval (IR) experimentation, and presented argument that (only) uniform-step interval should be used. Hence, it been argued, well-known metrics such as reciprocal rank, expected normalized discounted cumulative gain, average precision, either discarded tools, or adapted so their metric values lie at uniformly-spaced points on number line. These papers paint a rather bleak picture past decades IR evaluation, odds with community’s overall emphasis practical experimentation measurable improvement. Our purpose work is to challenge pessimistic assessment. In particular, we argue mappings from categorical ordinal data sets line are valid provided there an external reason for each target point have selected. We first consider general scales, categorical, ordinal, interval, ratio, absolute collections. connection two those categories also provide examples knowledge captured represented by numeric real Focusing then retrieval, document rankings data, effectiveness single value summarizes usefulness user population users any given ranking, able continuous variable ratio scale. That is, most current well-founded, and, moreover, more meaningful form than proposed “intervalized” versions.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Correlation and Prediction of Evaluation Metrics in Information Retrieval

Because researchers typically do not have the time or space to present more than a few evaluation metrics in any published study, it can be difficult to assess relative effectiveness of prior methods for unreported metrics when baselining a new method or conducting a systematic meta-review. While sharing of study data would help alleviate this, recent attempts to encourage consistent sharing ha...

متن کامل

Meaning in philosophy and meaning in information retrieval (IR)

Purpose -The paper explores the question of whether the differences between meaning in philosophy and meaning in information retrieval (IR) have implications for the use of philosophy in supporting research in IR. Design/methodology/approach Conceptual analysis and literature review. Findings There are some differences in the role of meaning in terms of purpose, content and use which should be ...

متن کامل

Ranking Metrics and Evaluation Measures

In this work, we present a general guideline to establish the relation between a distribution model and its corresponding similarity estimation. A rich set of distance metrics, such as Harmonic distance and Geometric distance, is derived according to Maximum Likelihood theory. These metrics can provide a more accurate model than the conventional Euclidean distance and Manhattan distance. Becaus...

متن کامل

Two Axioms for Evaluation Measures in Information Retrieval

In this paper evaluation measures for information retrieval system outputs are investigated from a measurement theoretic point of view. Two axioms are introduced: the axiom of monotonicity and the Archimedian axiom. It is shown that the measures fullfilling these axioms are exactly the measures equivalent to some measure of the form ~a + ~d where a is the number of relevant retrieved documents ...

متن کامل

Meaning-Focused and Quantum-Inspired Information Retrieval

8:30 am 9:00 am Coffee break 9:00 am 9:30 am Conference opening and introduction Derek Raine (Physics); Peter Jackson and Emmanuel Haven (Management) 9:30 am 10:30 am Plenary Talk Professor Edward Nelson Department of Mathematics Princeton University Title of Talk: Stochastic mechanics of particles and fields 10:30 am 11:00am Coffee break 11:00 am 12:30 pm Paper session I.: Meaning. Session Cha...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2022

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2022.3211668